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Abstract 

For a set of integers X, we define a g-ary Z-cycle to be a assignment 
of the symbols 1 through q to the integers modulo q n so that every 
word appears on some translate of X. This definition generalizes that 
of de Bruijn cycles, and opens up a multitude of questions. We address 
the existence of such cycles, discuss "reduced" cycles (ones in which 
the all- zeroes string need not appear), and provide general bounds on 
the shortest sequence which contains all words on some translate of X. 
We also prove a variant on recent results concerning decompositions 
of complete graphs into cycles and employ it to resolve the case of 
\X\ = 2 completely. 

1 Introduction 



A de Bruijn cycle of order n is a g-ary sequence (S(0), . . . , S(q n — 1)) so that 
every g-ary n-word appears in a "window" (S(j), . . . , S(j + n — 1)) for some 
j (indices taken modulo q n ). A reduced de Bruijn cycle is a string of length 
q n — 1 which achieves every n-word in some window, except for the word 
n . In this paper, we are concerned with such objects when the notion of 
"window" is generalized. 
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Given a sequence X = {i 3 -}" =1 C Z 9 n, we say that the map x '■ ~~ > [o\ 
(resp., x '■ ^V-i — > [?]) is an X-cycle (X*-cycle) if, for every word W G [g] n 
(resp., every word W G [q] n \ {0 n }), there exists a t G Z g n (resp., t G Z, 9 n„ 1 ) 
so that x(ij +t) — W(j). If such a sequence exists for X, we say that X is 
g-valid (resp., g*-valid). We will often abuse notation by writing x^X) f° r 
the n-word (x(h), ■ ■ ■ , x(*n))- Furthermore, we a/so use "cycle" to refer to 
sequences of edges in a directed graph each of whose tail is the head of the 
previous one, and which return to their starting point. It should always be 
clear from context which of these definitions we intend - though, often, the 
notions will coincide! 

It is classical that an X-cycle and an X*-cycle exist for X = {1, . . . , n}. 
In Section |2J we address the validity of other sets X. As it turns out, the 
question is rather difficult in general, and we solve the problem completely 
only for sets of cardinality 2. We present some general constructions and a 
number of computational results, and we discuss a graph-theoretic question 
whose solution is equivalent to the case of X being an arithmetic progression. 
In the next section, we discuss the existence of reduced de Bruijn cycles, and 
have a greater degree of success in characterizing the g*-valid sets. Then, in 
Section HI the issue of "approximate" cycles is discussed, and we present a 
nearly optimal bound on their length. The following section contains a proof 
of a graph-theoretic decomposition result that is used in Section |21 and which 
solves a variant of a family of problems that has appeared recently in the 
literature. We finish with a number of open questions and suggestions for 
future investigation. 

2 Unreduced Cycles 

It is easy to see that, to determine the two-element q- valid sets, we need only 
examine the sets {0, d} with d\q 2 . Indeed, for any k G , if there exists 
an X-cycle x f° r {0,d}, then x' '■ s x(k~ ls ) is an X-cycle for {0, kd}. In 
addition, it is clear that the validity of X is equivalent to the validity of X + b 
for any 6 G Z r The same arguments apply to sets X whose elements are in 
arithmetic progression: we need only examine the cases when the difference 
d divides q n . 

Suppose, then, that d\q n , and consider D™, the n th g-ary de Bruijn di- 
graph, i.e., the digraph whose vertices are the g-ary n-strings and which has 
an edge from x to y if the last n — 1 symbols of x are the same as the first 
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n — 1 symbols of y. (Note that some edges have loops attached.) Then the 
set AP{n, d) = {0, d, 2d, . . . , {n — l)d} is g-valid iff there is a partition of 
the edges of -D™ -1 into d cycles each of length q n /d, because we may write 
x(j) = C a (b), where C a is the a th such cycle and j = ad + b. 

Using Dg , which is simply a complete directed graph with loops on q ver- 
tices, we may state a condition equivalent to the q- validity of {0, d}: that D x q 
is fully decomposable into cycles of length q 2 /d. Theorem 1191 which appears 
Section says that this is possible precisely when q 2 /d > 2. Therefore, we 
have 

Theorem 1. There exists an X-cycle for X = {0, d} if and only if q 2 /d ^ 2. 

The situation for sets with |X| > 2 appears significantly more complicated, 
even for arithmetic progressions. However, the above invalidity result for 
d = q 2 /2 has an immediate analogue for larger n: 

Proposition 2. For any r, q, with r\q, the set AP(r,q r /r) is q-invalid. 

Proof. Suppose an X-cycle \ existed. Then, we may assume without loss of 
generality that (x(0), • • • , x(( r — l)g n / r )) is the all-zeroes vector. But, then 
(x(?V r )> x(2<l r / r ), • • • j x(0)) is also, a contradiction. □ 

On the other hand, we can construct a large family of AP(n, g)-cycles. 
Form the quotient graph G{n) from by identifying two vertices x and x' 
of if x — x 1 = k for some k G Z g , i.e., X{ — x\ = k for 1 < i < n. 

Fact 1. G(n) is isomorphic to -D? -1 - 

Proof. For x = (xi, X2, ■ ■ ■ , x n ), consider the map 

A: X h-> (xi -x 2 ,x 2 - X 3 , . . . ,X n -i -x n ). 

It is easy to check that A is well-defined on G(n), invertible and preserves 
directed edges (i.e., (x, y) is an edge in D™ if and only if (A(x), \(y)) is an 
edge in G(n)). Note that the inverse map A" 1 doesn't necessarily preserve 
cycles, though. However, it is not hard to show the following. Suppose that 
(g , qi, ■ ■ ■ , q r -i) is a cycle in G(n) = D^ 1 , i.e., 

((<7i+l ; Qi+2, ■ ■ ■ , Qi+n-l), (<?i+2, qi+3, ■ ■ ■ , Qi+n)) 

is an edge for all i, where the indices are reduced modulo r. Then this cycle 
"lifts" under A -1 to a cycle in D™ if and only if Y2l=o 9i = (mod q). 
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Observe that any Eulerian cycle C in D™~ 1 satisfies this sum condition. 
Hence, it lifts to a cycle C + in D™ going through exactly one of the q points 
in each equivalence class. In fact, we can form q disjoint cycles {C^"}' =1 from 
this cycle C + by translating each point in it by some fixed constant q G 7L q . 

Finally, we can form a cyclic sequence S in D™ containing all of its vertices 
by "splicing together" these q cycles Cj~ in the obvious way. Since C was in 
fact a de Bruijn cycle for (n — l)-tuples, then it easily checked that S is an 
X-cycle with X = {0, q, 2q, . . . , (n — l)q}. 

As an example, consider the cycle 001122021 for n = 2, q = 3. We can lift 
this to 100021200 for n = 3,q = 3, form the two translates 211101011 and 
022210122 and splice them together to get 021210210210102021102210210, 
which is a {0, 3, 6}-cycle. 

In fact, since there are many ways of choosing the first de Bruijn cycle 
([(q — l)\] qn ■ q qn to be precise), and many ways of splicing them 

together (q\), there quite a few ways of producing such cycles. Unfortunately, 
it is not possible to iterate this construction, since the cycles do not 
themselves have the zero-sum property needed to lift them again. 

Proposition 3. AP(n,q) is q-valid for any n, q. 

Suppose that {0, d, 2d} is g-valid, where 8\k = q 3 /d. Then there exists 
a decomposition of the edges of D 2 q into cycles of length k. Write £ for 
the set of even symbols in [2q] and O for the set of odd symbols. We may 
think of D% as being composed of four parts: U\ = £ x £, U 2 = £ X O, 
U 3 = O x O, and = O x £. U 2 and U4 contain no edges; U\ and U 3 are 
copies of D 2 q . We may therefore decompose U\ and U 3 into fc-cycles. The 
remaining edges may be decomposed into 4-cycles and 8-cycles as follows. 
For each pair (a, b), (b, c) with a,b G £ and c G O, define a cycle C(a, b, c) = 
((a, b), (b, c), (c, a + b + c), (a + b + c, a)), with addition modulo 2q. The 
resulting 4-cycles partition all edges which do not belong to U\ x Ui, U 3 x U 3 , 
U2 x U4, or U4 times \J 2 - The edges in these final two classes come in pairs 
{(b, c), (c, b)}. We may attach {(6, c), (c, b)} and {(26 + C+2, b), (b, 2b + c + 2)} 
to the cycle C{b + 2, b, c) for each b even and c = 1 (mod 4), thus turning it 
into a 8-cycle. Doing so accounts for all the remaining edges exactly once. 

The result is a set of 8-cycles and 4-cycles. We may partition them into 
classes so that each class has exactly k edges, and "join" each class at TJ\ 
into a cycle of length k. The resulting decomposition of -D^j gives rise to a 
{0, 8d, 16<i}-cycle. Hence, we have the following. 
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Proposition 4. If AP(3, d) q-valid, where 8\q 3 /d, then AP(3,8d) is 2q- 
valid. 

Corollary 5. AP(3, 8 fc ) is r2 k+1 -valid for all k > and r > 1. 

Proof. Begin with the 2r- valid {0, 1, 2} and iterate the above proposition. □ 

To illustrate the complicated nature of the d > 2 case, we offer the fol- 
lowing computational observations. By "affine equivalence" , we mean a map 
o~ : s —> ks + b for some k G Z*„ and b G Z ? n . Clearly, the partition of index 
sets into valid and invalid is refined by the partition into affine equivalence 
classes. 

1. For (q, n) = (3, 3), the only invalid index sets are {k, k + 9, k + 18} for 
k = 0...8. 

2. The only 2-invalid 3-set (up to affine equivalence) is {0,1,3}. For 
(q, n) = (2, 4), the only valid X's (up to affine equivalence) are the nine 
sets {0, 1, 2, 3}, {0, 1, 2, 6}, {0, 1, 2, 7}, {0, 1, 3, 4}, {0, 1, 3, 7}, {0, 1, 3, 8}, 
{0, 1, 3, 9}, {0, 1, 3, 14}, and, of course, {0, 2, 4, 6}. 

3. For (q, n) = (2, 5), the following list contains one representative of each 
equivalence class of invalid index sets, in lexicographic order: 



0,1,2,3,12 


0,1,2,4,7 


0,1,2,4,9 


0,1,2,4,12 


0,1,2,4,23 


0,1,2,4,24 


0,1,2,4,25 


0,1,2,4,26 


0,1,2,4,27 


0,1,2,5,7 


0,1,2,5,8 


0,1,2,5,9 


0,1,2,5,13 


0,1,2,5,14 


0,1,2,5,15 


0,1,2,5,16 


0,1,2,5,19 


0,1,2,5,20 


0,1,2,5,21 


0,1,2,5,22 


0,1,2,5,24 


0,1,2,5,25 


0,1,2,5,26 


0,1,2,6,9 


0,1,2,6,11 


0,1,2,6,13 


0,1,2,6,14 


0,1,2,6,15 


0,1,2,6,16 


0,1,2,6,17 


0,1,2,6,19 


0,1,2,6,21 


0,1,2,6,23 


0,1,2,6,25 


0,1,2,6,26 


0,1,2,7,11 


0,1,2,7,14 


0,1,2,7,15 


0,1,2,7,19 


0,1,2,7,22 


0,1,2,7,23 


0,1,2,7,24 


0,1,2,8,12 


0,1,2,8,13 


0,1,2,8,16 


0,1,2,8,17 


0,1,2,8,18 


0,1,2,8,19 


0,1,2,8,23 


0,1,2,8,24 


0,1,2,8,25 


0,1,2,9,13 


0,1,2,9,14 


0,1,2,9,15 


0,1,2,9,16 


0,1,2,9,17 


0,1,2,9,19 


0,1,2,9,20 


0,1,2,9,21 


0,1,2,9,22 


0,1,2,9,25 


0,1,2,10,14 


0,1,2,10,15 


0,1,2,10,16 


0,1,2,10,17 


0,1,2,10,18 


0,1,2,10,20 


0,1,2,11,13 


0,1,2,11,14 


0,1,2,11,15 


0,1,2,11,16 


0,1,2,11,19 


0,1,2,12,15 


0,1,2,12,17 


0,1,2,12,19 


0,1,2,13,16 


0,1,2,13,18 


0,1,2,13,20 


0,1,2,14,17 


0,1,2,15,18 
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0,1,2,16,18 


0,1,3,4,9 


0,1,3,4,11 


0,1,3,4,12 


0,1,3,4,15 


0,1,3,4,16 


0,1,3,5,9 


0,1,3,5,11 


0,1,3,5,12 


0,1,3,5,13 


0,1,3,5,15 


0,1,3,5,17 


0,1,3,5,21 


0,1,3,5,22 


0,1,3,5,24 


0,1,3,5,25 


0,1,3,5,26 


0,1,3,7,9 


0,1,3,7,11 


0,1,3,7,12 


0,1,3,7,15 


0,1,3,7,16 


0,1,3,7,17 


0,1,3,7,19 


0,1,3,7,23 


0,1,3,7,24 


0,1,3,7,27 


0,1,3,7,30 


0,1,3,8,10 


0,1,3,8,12 


0,1,3,8,14 


0,1,3,8,16 


0,1,3,8,17 


0,1,3,8,19 


0,1,3,8,20 


0,1,3,8,21 


0,1,3,8,23 


0,1,3,8,24 


0,1,3,9,13 


0,1,3,9,16 


0,1,3,9,17 


0,1,3,9,20 


0,1,3,9,25 


0,1,3,9,26 


0,1,3,9,28 


0,1,3,9,30 


0,1,3,10,12 


0,1,3,10,13 


0,1,3,10,14 


0,1,3,10,15 


0,1,3,10,16 


0,1,3,10,20 


0,1,3,10,23 


0,1,3,10,30 


0,1,3,12,13 


0,1,3,12,16 


0,1,3,12,24 


0,1,3,12,25 


0,1,3,12,27 


0,1,3,12,28 


0,1,3,13,15 


0,1,3,13,21 


0,1,3,13,22 


0,1,3,13,25 


0,1,3,13,27 


0,1,3,13,28 


0,1,3,14,15 


0,1,3,15,16 


0,1,3,15,20 


0,1,3,15,21 


0,1,3,15,22 


0,1,3,15,23 


0,1,3,15,25 


0,1,3,15,27 


0,1,3,15,28 


0,1,3,16,17 


0,1,3,16,19 


0,1,3,16,21 


0,1,3,16,23 


0,1,3,16,25 


0,1,3,16,27 


0,1,3,17,19 


0,1,3,17,21 


0,1,3,17,23 


0,1,3,17,25 


0,1,3,17,27 


0,1,3,17,28 


0,1,3,19,27 


0,1,3,20,24 


0,1,3,21,24 


0,1,3,21,25 


0,1,3,22,24 


0,1,3,22,25 


0,1,3,23,24 


0,1,3,23,25 


0,1,3,23,27 


0,1,3,24,28 


0,1,3,25,27 


0,1,3,27,28 


0,1,4,5,13 


0,1,4,6,12 


0,1,4,6,14 


0,1,4,6,17 


0,1,4,6,18 


0,1,4,6,20 


0,1,4,8,23 


0,1,4,9,15 


0,1,4,9,16 


0,1,4,9,17 


0,1,4,9,20 


0,1,4,12,14 


0,1,4,12,18 


0,1,4,13,14 


0,1,4,14,17 


0,1,4,14,28 


0,1,4,14,29 


0,1,4,15,16 


0,1,4,15,20 


0,1,4,15,23 


0,1,4,15,28 


0,1,4,16,26 


0,1,4,17,26 


0,1,4,18,26 


0,1,4,26,28 


0,1,5,7,16 


0,1,5,8,16 


0,1,6,8,17 


0,1,7,8,17 


0,1,7,9,15 


0,1,7,9,16 


0,1,7,9,17 


0,1,7,15,16 


0,1,8,16,17 


0,1,8,16,24 


0,2,4,8,14 


0,2,4,10,14 


0,2,4,10,18 


0,2,4,10,24 


0,2,4,10,26 


0,2,4,16,20 


0,2,6,18,22 


0,2,8,16,18 


0,2,8,16,24 


0,4,8,16,24 





3 Reduced Cycles 

Although the definition of g*-validity certainly makes sense when q is not a 
prime power, we restrict our attention to that case in this section. Therefore, 
consider q > 2 a fixed prime power, and take our alphabet to be ¥ q . Let a 
be a generator of the multiplicative group of the finite field ¥ q n. Denote by 
£ the elementary basis for over ¥ q . Given a basis B = {pi, . . . , b n } of ¥ q n 
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over ¥ q and an element 7 £ ¥ q n, write fei'j) for the element of F™ whose 
j th coordinate is the coefficient of bj in the ^-representation of 7. Then, 
given a nonzero vector v £ F™, define A(a,$, v) to be the string whose j th 
coordinate (i.e., Aj(a,B, v), < j < g n — 2) is v T /g(a J ). 

It is well known that, when B = {a- 7 : < j < n — 1} and v has only one 
nonzero coordinate, A(a,B, v) is a reduced de Bruijn cycle of order n (e.g., 
jH].) We generalize this result as follows. 

Proposition 6. Let X = {ij}™ =1 be a sequence of distinct integers. Fix a 
basis B o/F" over ¥ q , a generator a £ ¥ qn , and a vector v £ F™ and wnie 
$(£) /or t/ie vector 

(A h+t (a,B,v), . . . ,A in+t (a,B,v)y £ F£ 

w?£/i indices taken modulo q n — 2. // i/ie minimal polynomial of a is not 
a divisor of Y^=i c o x%1 f or an V nonzero (ci, . . . , c n ), then the map \I/ which 
sends to and a* to $(t) is an isomorphism from the additive group of¥ q n 
to ¥ n q . 

Proof. First, we show that \1/ is linear. Write Cj for the elementary n- vector 
whose coordinates are all zero except for a 1 in the j th coordinate. We denote 
by M 7 g the matrix representing multiplication by 7 £ ¥ q n in the B basis. It 
is easy to see that 

A k (a,B,v) = ^f B (a k ) 

and therefore that 

n n 

*(t) = E e ^(« !j 7) = J2 e ^ T KBHi), (i) 

3=1 i=i 

which is obviously linear. 

Now, suppose that ^(7) = and 7 = oA If we denote by S the subspace 
of F™ orthogonal to v, then we have a lj+t £ for each j. However, 

fs is linear and has a trivial kernel, so all the lie in a subspace of 

¥ q of dimension n — 1 and are therefore linearly dependent. Since M Q j g is 
nonsingular, this implies that {a ij }j =1 is a dependent set. But then we have 

3=1 

for some nonzero (ci, . . . , c n ), a contradiction. □ 
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The map 7 1— > M 7j g is actually an isomorphism of fields. The image is 
a set of matrices which form a field, i.e., a matrix field. These objects have 
been studied extensively and thoroughly characterized when the matrices 
take their entries from a finite field ((!]). 

Corollary 7. If the minimal polynomial of a, a multiplicative generator 
of ¥gn, is not a divisor of Y^j=i c j xli f or an V nonzero (ci,...,c n ), then 
A(a,£>, v) is an X* -cycle. 

Proof. By the above argument, A (a, B, v) contains all nonzero n-strings in 
shifted copies of the index set X. □ 

We require another definition. 

Definition 8. The index set X = {i,}™ =1 is called exceptional for q if, for 
every primitive polynomial g G F q [x] of degree n, there exists a nonzero vector 
(ci, . . . , c n ) G F™ so that g divides 

n 

( 2 ) 

i=i 

Equivalently, if for every a a generator of¥* n , the set {a lj } is linearly de- 
pendent over F q , then X is exceptional for q. An index set which is not 
exceptional is called ordinary for q. 

Note that the exponents in (J2J) can be thought of as belonging to Z^n^. 

Proposition 9. X is q* -valid whenever X is ordinary for q. 

Proof. If X = {ij}" =1 is not exceptional, then there exists a primitive poly- 
nomial g G ¥ q [x] of degree n so that, for all nonzero (c\, . . . , c n ), g is not a 
divisor of Yll=i c j x%j ■ Since x is not a root of Yll=i c j x%i i n Fjxj/g, but it 
is a multiplicative generator of this field, A(x,B, v) is an X*-cycle for any B 
and v. □ 

Which index sets are ordinary? We argue that if (q n — l,d) = 1, then 
(a, a+d, a+2d, . . . , a+(n— l)d) is ordinary. It is clear that, if Xis ordinary, all 
of its translates are as well. We may therefore assume that a = 0. Then some 
irreducible polynomial g of degree n divides Y^jZo c j x ^ d f° r eacn ( c i> • • • > c n) 7^ 
0. Let a be a root of g, so a d is a root of ^j=o Cj^" 7 - That this polynomial 
has degree less than n contradicts the fact that a d is a generator of F^ n . 
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It is trivial that X is ordinary if it is a singleton. If X has two elements, 
then it is easy to see that X = {i, j} is ordinary if q + 1 \ % — j, since then 
a l ~ j G" F q . Indeed, the copy of F* lying inside of F* 2 is the set 
for any generator a. Conversely, if q + l\i — j, then, for every generator a, 
we have a 1 = ca^ for some c G F q . Therefore, a two-element set is ordinary 
for q if and only if the difference of the elements is not a multiple of q + 1. 

For a prime p and a positive integer n, define the Jacobi logarithm as 
follows: for a generator a of F*„, define L a : Z p n_! \ {s} i— > Z p n_i \ {0} by 
1 + a 1 = a La ^\ where s = (p n — l)/2 if p > 2 and s = otherwise. 

Proposition 10. A three-element set {i,j,k} is exceptional if and only if 
either: 

1. Q\j- i, 

2. Q\k-j, 

3. Q\k-i, 

4- or, for all m with (m, q 3 — 1) = 1, there exists an a so that 
L a (aQ + m(j — i)) = m{k — i) mod Q, 

where Q = q 2 + q + 1, and a is any (fixed) generator ofF* :i . 

Proof. There are only two ways that a 1 , a? , and a k can be linearly dependent. 
Either one of them is an F 9 -multiple of another, or, for some triple {ci, C2, C3}, 
with Ci ^ for all i, 

cxd + c 2 a j + c 3 a k = 0. (3) 

The former case is precisely the divisibility conditions stated above. To see 
that the latter situation is equivalent to condition |3J we may rewrite (jSJ) 
without loss of generality as 

c 4 o/- fe + c 5 a j ~ k = 1 (4) 

with c 4 , c 5 G F g . Suppose {i,j, k} falls into this case, i.e., has a solution 
in C4 and C5. Since we have assumed that neither c 4 nor C5 is zero, we may 
express each of them in terms of a: respectively, a s ® and a 1 ®, for some s,t 
integers. Then we may rewrite as 

^sQ+i-fc^ _|_ Q,Q(*- s )+i-^ = a sQ+i-k+L a (Q(t-s)+j-i) _ ft 
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which is to say, that sQ + i — k + L a (Q(t — s) +j — i) =0 mod q 3 — 1. (Note 
that the fact that L a is not defined on all of "L q 3-\ is not problematic, since 
the left hand side of (@J) cannot be zero.) This equation has a solution in s 
and t iff there exists an a so that L a (aQ + j — i) = k — i mod Q. 

If we choose any other generator j3 of F* 3 , there is some m such that 
(m, q 3 — 1) = 1 and a m = (3. Then Lp{a) = b iff L a (am) = bm, so {i, j, /c} is 
exceptional iff z, Q\k—j, Q\k—i, or, for some m such that (m, g 3 — 1) = 1, 
there exists an a so that 

L a (aQ + m(j — i)) = m{k — i) mod Q. 

□ 

4 Approximate Cycles 

Since the question of whether an X-cycle exists appears difficult in general, 
we may ask instead whether it is possible to find an "approximate" X-cycle. 
This question comes in two forms for an index set X = {ij}'j =1 C Z: 

1. What is the least N for which a x '■ Ztv — > [q] exists so that, for every 
word W, there exists an m with W = x(X + m)? 

2. What is the least N for which there exists a x so that all but o(q N ) 
words appear as m ) f° r some ml 

Call the former object a Type / approximate cycle and the latter a Type 77 
approximate cycle. Then we can show: 

Theorem 11. Let F{n) be any function so that q~ n F(n) — ► oo as n — ^ oo. 
T/ien i/iere exzsfo a q-ary Type II approximate X-cycle of length F{n) when 
|X| = n. 

We need a result of Janson to proceed. The following appears in 
First, some notation. Let I be an index set for a set of events {Bi} ie j. 
Define a graph ~ on I with the following property: Let J\ and J2 be two 
disjoint subsets of I such that there is no i\ G J\ and i 2 G J2 with z'i ~ i 2 . 
Now, let A 1 be any Boolean function of the events {Bi : i G Ji} and let A 2 
be any Boolean function of the events {Bi : z G J 2 }- Then A 1 and A 2 are 
independent. 

Let n = EZiVW, A = E^P(^ A B,), and 5 = max, ^ P( /; ? ). 
Then the following holds. 
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Lemma 12. With the above notation, 

P(A£ 1 S i ) < exp(-min 



fi 2 fi fi 
8A' 2' 65 



Proof of Theorem\ll\ Fix an integer m. We wish to show that (for a suitable 
choice of m) the expected number of g-ary n-words which do not appear as 
x(X + j) for any j G Z m for a random function x '■ — > <? is o(q n ). This 
quantity is g n times the probability that a single word - say, n - does not 
appear anywhere. So we need only show that this probability is o(l). Let Bj 
be the event that x{t + j) = W. Then define a graph on the Bj as follows: 
Bj ~ 5 fc iff (X + j) n (X + k) ^ 0. Note that deg(5 i ) < n 2 for all j, and 
P(-Bj) = g~ n . Therefore /i = mq~ n , 

A < g n n 2 g" n = n 2 

and 8 < n 2 q~ n . Plugging into the lemma, we find 

m 2 m m 



P(A^ 1 i? J ) < exp(-min — — 



n 2 q n 2q n Qn z 

If we let m = F(n), then min(m 2 /8n 2 g n , m/2q n , m/6n 2 ) — - > oo, completing 
the proof. □ 

Now, we address the problem of constructing Type I approximate X- 
cycles. First, for a set of reals X = {xj}" =1 , define 

/i(X) = maxmin || a(xj — Xj~)\\, 

where ||x|| denotes the distance to the closest integer. It is proven in |U] that, 
for any set X of cardinality n, fi(X) > n~ 2 . Then we have the following. 

Lemma 13. For any X = {i,}™ =1 C Z and collection of q-ary n-words 
Wi, . . . , Ws, there exists an integer p = n 2 S'(l + o(l)) and a map x '■ Z p —> [q] 
so that every Wj appears as x(X + t) for some t £ Z p . 

Proof. Let p be the smallest prime greater than n 2 (S + 3), and choose a 
achieving the bound fi(X) > n~ 2 . Then there exists a k G Z p so that 
| a — k/p\ < p' 1 , and it follows that, for any j ^ k, kij and kik are separated 
by at least pn~ 2 — 3 > S integers modulo p. Write W(k) for the k th symbol 
of the word W. Then we may define x '■ ~^ [q] by xi^ij +t) — W t (j) for 
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1 < t < S and 1 < j ' < n, since the minimum gap between elements of kX is 
at least S. Define x( s ) arbitrarily for s ^ kij + 1 for any j and t. Then the 
map x' '■ s ~^ x{ks) has the desired property. □ 

We could use this result immediately to achieve a length n 2 q n (l + o(l)) 
Type I approximate X-cycle, but it possible to do better using the random 
construction above. There is a trivial lower bound of q n on the length of any 
approximate cycle, and we can show an upper bound that is only slightly 
worse: 

Theorem 14. For any X = {ij}J =1 C Z ; there exists a q-ary Type I approx- 
imate X-cycle of length p = (8 + o(l))g n logn. 

Proof. The basic idea is to take a random sequence 7\ that contains almost 
all words, then use Lemma fTSl to "tack on" the remaining ones. We use the 
notation of the proof of Theorem 1111 If we let F(n) = [4g n logn], then, for 
sufficiently large n, the expected number of words which do not appear as 
x(X + t) is at most 

n 

g n exp(-21ogn) = — . 

n z 

Apply the lemma to find a sequence T<i in which each word missed by the 
random sequence occurs. Then, we concatenate two copies of T\ with two 
copies of T2 (two copies are used to avoid disturbing sequences which "wrap 
around"), and the result is a Type I approximate X-cycle of the stated length. 

□ 



5 Decompositions into Directed Cycles 

A number of recent papers (most notably PQ,|2], an d 0) have addressed (and 
solved) the problem of decomposing a complete (possibly directed) graph into 
a set of cycles of prescribed length. Generally speaking, so long as the cycle 
lengths add up to the number of edges, there are very few impediments to the 
existence of such decompositions - although demonstrating this is far from 
simple. None of this work has dealt with graphs containing loops, however; 
in this section, we address this situation, in the case when all the cycles have 
the same length. 

Let K n denote the complete directed graph with loops on n vertices, i.e., 
the vertex set is [n] and the edge set is [n] x [n], and let K n denote the 
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complete directed graph without loops. We wish to know, for which n and 
d is it possible to decompose the edge set of K n completely into cycles of 
length d? Clearly d must divide n 2 . However, our main result states that 
the only other obstruction is that d > 2. (Indeed, it is easy to see in these 
cases that such a decomposition is not possible.) 
First, we state a result from j^j. 

Theorem 15. IfYl t i=i m i = n ( n ~ -0 an d m i — 2 f or i — 1, ■ ■ ■ ,t, then K n 
can be decomposed as the edge- disjoint union of cycles of lengths m±, . . . , m t , 
except in the case when n = 6 and all rrii = 3. 

This will imply the following: 

Proposition 16. If d\n 2 , d = 6 or d > 8, then K n may be decomposed into 
cycles of length d. 

Proof. We offer a procedure for "packing" length d cycles first into G, then 
the Cr/s, then H. Split the vertex set of K n into two pieces: U = {a, b} and 
V = {1, . . . ,n — 2}. We may then decompose the edge set of K into the 
following pieces: 

1. one K 2 on U, 

2. one K n -i on V, 

3. and n — 2 graphs each of which has vertex set {a, b,j} for some j £ V, 
with a loop at j and edges (j, i) and for i = a,b. 

Such a decomposition is possible because d > 6 implies n > 2. 

Call the first graph H, the second G, and the third Gj. Denote by {u, v} 
the pair {(u,v), (v,u)}. Suppose ("j 2 ) = r m °d d with 1 < r < d. Suppos- 
ing r > 1 , by Theorem EE we may decompose G into K = [(( n ~ 2 ) - l)/dj 
length d cycles and one r-trail, which we call T. We may assume, without loss 
of generality, that T meets vertex 3. Let m = \_(d — r)/5j and d' = d—r—bm. 
We construct a graph X as follows. Take X to be the union of T, Gj for 
j = 1, . . . m — 1, and one of the following, according to the value of d' . Note 
that d' < 4 implies m < n — 3, since otherwise the number of remaining 
edges (i.e, ones unaccounted for thus far by X or any of the cycles in K n _ 2 ) 
would not be divisible by d. Similarly, d! = 4 implies either m < n — 3 or we 
may add in all the remaining edges of the graph and be done. 
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• d! = : Add G m to X. 

• d! — 1 : If m = 0, add the edge (1, 1) to X. If m > 0, add the edges 
{a, m}, (m, m), {a, m + 1}, and (m + 1, m + 1). 

• d' = 2: Add {a,m + 1}. 

• d! = 3 : Add {a, m + 1} and (m + 1, m + 1). 

• d! = 4 : Add {a, m + 1} and {a, m + 2}. 

Note that in all cases, X is connected and has equal in- and out-degree at 
every vertex; therefore, X is Eulerian and may be written as a cycle of length 
d. 

Now, if not all edges of K n have been accounted for, yet, we wish to add 
another length d cycle. At the previous step, there are a few possibilities for 
the remaining set of edges not assigned to cycles. The set contains H, some 
of the Gj, as well as either: 

1. Case I: {a, m}, {b, m}, 

2. Case II: {b, m}, {b, m + 1}, 

3. Case III: (m, m), {b,m}, 

4. Case IV: {b, m}, or 

5. Case V: (m, m), {b, m}, (m + 1, m + 1), {b, m + 1}. 

In each of the cases, let Y" be the set of edges listed above. We now let 
m = \_(d — |y|)/5j and d! = d — 5m — \ Y\. We may proceed exactly as above 
with the construction of X, unless m — 0. If m — 0, then either > 2 or 
<i' = 0. In the latter case, d = 6 and we are in Case V, so we may simply 
take X = Y. In the former case, we append Y, as well as d' > edges drawn 
from the first full "unused" Gk as follows. (Note that, if there is no unused 
Gk-, we may append the remaining edges in the graph and be done.) 

• d' = 2: Add {b, k}. 

• d! = 3 : Add {b, k} and (k, k). 
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• d! = 4 : If < n — 2, add {6, A;} and {b, k + 1}. It is not possible for 
k = n — 2, since there would be too few edges left for d to divide them 
evenly. 

Again, the resulting graph X is Eulerian, and can therefore be written as 
a length d cycle. It is clear that we may repeat the previous step until a full 
decomposition is achieved, though we may need to switch the roles of a and 
b. 

It remains to deal with the case of r = 1 at the beginning of the proof. 
Instead of decomposing into all length d trails except for one r-trail, we 
instead create one length d — 1 trail and one length 2 trail. Then attach (a, a) 
to the d—1 trail (changing labels if necessary to maintain connectivity), 
and proceed as above with r = 2, using the remaining edges of H when 
necessary. All of the details of the preceding argument work with this slight 
modification. □ 

Proposition 17. If n is even, then K n may be decomposed into cycles of 
length 4. 

Proof. There are two cases: n = 2 mod 4 and n = mod 4. Suppose the 
former. Then the following types of edge-sets partition E(K n ) into cycles of 
length 4: 

1- (j + n/2,j + n/2), (j,j + n/2), (j + n/2,j)} for each j with 1 < 

j < n/2, 

2. { (j, j + 2k- 1), (j + 2k- l,j), (j, j + 2k), (j + 2k, j)}, for each j and 
each k so that 1 < k < (n — 2)/4. 

If 4|n, we use the following types of sets instead: 

1- (j + n/2,j + n/2), (j,j + n/2), (j + n/2,j)} for each j with 1 < 

j < n/2, 

2. { (j, j + 2k), (j + 2k, j), + 2k + 1), (j + 2k + l,j)}, for each j and 
each k so that 1 < k < n/A — 1. 

3. {{2j, 2j - 1), (2j - 1, 2j), (2j, 2j + 1), (2j + 1, 2j)}, for each j with 1 < 
j < n/2. 

□ 
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Proposition 18. If d\n, d = 3, 5, or 7, then K n may be decomposed into 
cycles of length d. 

Proof. We imitate the case of d > 8 here, only the situation is simpler. 
Suppose d = 3. We may decompose K n into three pieces: the loop (a, a), a 
K n _i, and n — 1 length 3 cycles G\, . . . , G n _i. Applying Theorem [T^l we 
may break the second of these into cycles of length 3, except for one of length 
2, which we call T. (Since 3|n 2 , (n — l)(n — 2) = 2 mod 3.) We may assume 
that T meets a; therefore, appending (a, a) to T and taking each Gj as its 
own cycle provides a decomposition. 

Now suppose d — 5. We may decompose into three pieces again: a 
on {a,b}, a K n _ 2 , and n — 2 length 5 cycles Gi, . . . , G n _ 2 . Applying 
Theorem IToT we may break the K n _ 2 into cycles of length 5, except for one 
of length 4 and one of length 2, which we call T\ and T 2 , respectively. (This 
time, (n — 2)(n — 3) = 1 mod 5.) We may assume that T\ meets a and T 2 
meets b. Append (a, a) to 7\, \ (a, a) to T 2 , and take each Gj as its own 
element of the decomposition. 

Finally, let d = 7. We have the following decomposition of K n : a K 3 on 
{a, b, c}, a K n _ 3 , and n — 3 length 7 cycles Gi, . . . , G n _ 3 . Applying Theorem 
Hoi we may break the K n _ 3 into cycles of length 7, except for one of length 
5, which we call T' . (Now, (n — 3)(n — 4) = 5 mod 7.) We may assume that 
T' meets a. Append {a, b} to T', include the 7-cycle K 3 \ {a, b}, and take 
each Gj as its own element of the decomposition. □ 

All of these results together imply the following. 

Theorem 19. If d\n 2 and d > 3, then K n may be decomposed into cycles of 
length d. 

6 Conclusion 

We wish to know, first and foremost, what distinguishes valid sets from 
invalid ones. The authors' attempts to find a simple way to separate these 
cases was met with frustration. A simpler problem is the case of index sets 
which are arithmetic progressions: the question of decomposing de Bruijn 
graphs into cycles is a natural one. We would also like to see corresponding 
lower bounds or improvements on the bounds of Section EJ particularly in 
the case of Type I approximate cycles. 
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Finally, the problem solved by Theorem IT9*1 has a natural generalization 
along the lines of other, similar work. Suppose that {rrij} is such that 
Y2j m j = n 2 . When is it possible to decompose K n into cycles of lengths 
{rrij}? Clearly, there can be at most n l's among the rrij, and there must be 
at least n indices j for which rrij ^ 2. Are there additional restrictions? 
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